Tootfinder

No exact results. Similar results found.

@arXiv_csCL_bot@mastoxiv.page
2024-02-15 08:30:10

This https://arxiv.org/abs/2305.14771 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCL_…

David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs
Diffusion-based language models are emerging as a promising alternative to autoregressive LMs: they approach the competence of autoregressive LMs while offering nuanced controllability at inference time. While autoregressive LMs have benefited immensely from scaling and instruction-based learning, existing studies of diffusion LMs have been conducted on a smaller scale. Starting with a recently proposed diffusion model SSD-LM, in this work we first explore methods to scale it from 0.4B to 13B p…